Picture for Zhi Zheng

Zhi Zheng

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Add code
Jan 29, 2026
Viaarxiv icon

Token-level Collaborative Alignment for LLM-based Generative Recommendation

Add code
Jan 26, 2026
Viaarxiv icon

Self-Manager: Parallel Agent Loop for Long-form Deep Research

Add code
Jan 25, 2026
Viaarxiv icon

VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit

Add code
Jan 09, 2026
Viaarxiv icon

DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation

Add code
Jan 09, 2026
Viaarxiv icon

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Add code
Nov 09, 2025
Viaarxiv icon

A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning

Add code
Sep 26, 2025
Figure 1 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 2 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 3 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 4 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Viaarxiv icon

PerchMobi^3: A Multi-Modal Robot with Power-Reuse Quad-Fan Mechanism for Air-Ground-Wall Locomotion

Add code
Sep 16, 2025
Viaarxiv icon

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Add code
Sep 09, 2025
Viaarxiv icon

TextOnly: A Unified Function Portal for Text-Related Functions on Smartphones

Add code
Aug 23, 2025
Viaarxiv icon